Landmark based recognition of stops: acoustic attributes versus smoothed spectra

نویسندگان

  • Veena Karjigi
  • Preeti Rao
چکیده

Landmark based recognition of unvoiced word-initial stops is investigated. The relative effectiveness of acoustic-phonetic attributes versus more global spectral shape features is experimentally evaluated for four-way place classification of unvoiced, unaspirated stops. Various feature sets derived from the burst and vocalic transition regions of word initial consonants are compared via GMM based classification under speaker, gender, and vowel-context variability. While a set of acoustic attributes derived from the burst shows the best invariance to vowel context, it is found that global spectral shape features provide the most robust representation of the vocalic transition region by overcoming the problem of errors in explicit formant tracking. A combination of features from the burst and vocalic regions was superior to burst-only cues, but still far from the near perfect identification achieved in human perception.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...

متن کامل

Classification of stop consonant place of articulation

One of the approaches to automatic speech recognition is a distinctive feature-based speech recognition system, in which each of the underlying word segments is represented with a set of distinctive features. This thesis presents a study concerning acoustic attributes used for identifying the place of articulation features for stop consonant segments. The acoustic attributes are selected so tha...

متن کامل

Guaraní Voiceless Stops in Oral versus Nasal Contexts: An Acoustical Study

This acoustic study investigates voiceless stops in Guaraní that are described as transparent to nasal harmony. Voiceless stops in oral versus nasal contexts are examined in relation to theoretical issues of locality and phonetic implementation. First, the oral/nasal and voicing properties of the stops are considered in connection to proposals in phonological theory that feature spreading produ...

متن کامل

Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition

This paper presents an attempt to introduce unvoiced landmarks into statistical continuous speech recognition system. The unvoiced landmark detection algorithm proposed here locates the points in speech where the vocal folds stop or begin freely vibrating. In our experiments, 87.47% of stops and 98.94% of fricatives are segmented from speech after the unvoiced landmark detection, with a very lo...

متن کامل

Classification of Stop Consonant Place of Articulation: Combining Acoustic Attributes

This study evaluates the classification of stop consonant place of articulation in running speech using knowledge-based cues. Acoustic attributes are chosen to capture four categories of cues: amplitude and energy of burst, formant movement of adjacent vowels, spectrum of noise after the release, and some temporal cues. Correlation analysis shows no redundant information among cross-category at...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008